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V. 

ON A THEOEEM OF SYLVESTER'S RELATING TO 
NON-DEGENERATE* MATRICES. 

By Henry Taber, Clark University. 

Presented March 9, 1892. 

In his Memoir on Matrices, Cayley enunciated the theorem 
that the most general matrix commutative with a given matrix was 
a rational integral function of it. Of this theorem Sylvester gave 
a proof, in the Johns Hopkins University Circulars, Vol. III. 
pp. 34 and 57, for the case in which the latent roots of the given 
matrix were all distinct, but pointed out that, when the latent 
roots of the given matrix were not all distinct, the theorem did not 
always hold. Subsequently, in the Comptes Rendus, Vol. XCVIII. 
p. 471, Sylvester stated that he had proved that Cayley's theorem 
held for a matrix of the third order which was not degenerate (de- 
rogatoire), | irrespective of equalities between its latent roots ; and 
he further stated that Cayley's theorem was probably true for non- 
degenerate matrices of any order; i. e. that the most general ma- 
trix commutative with a non-degenerate matrix of any order 
whatever was a rational integral function of it. I shall in this 
paper verify this conjecture of Sylvester's. The theorem follows 
very simply from one given by the author in a note in these Pro- 
ceedings for May 26, 1891, t which contains the complete solution 
of the problem to find the most general matrix commutative with 
a given matrix. 

The theorem is then to be proved for the case in which the latent 
roots of the given matrix are not all distinct. If the distinct la- 
tent roots of the given matrix O are g lt g 2 , g 3 , etc., of multiplicity 

* Sylvester employs the term matrice derogatoire (Comptes Rendus, Vol. 
XCVIII. p. 471) to denote a matrix whose identical equation is of order lower 
than the order of the matrix ; such a matrix I term a degenerate matrix. 

t See preceding note. 

t These Proceedings, Vol. XXVI p 64. 
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m, n, p, etc., respectively, then Q may be written as <o6vr\ where 
o) is a non-vacuous matrix, and 
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in which all the constituents are zero except those in the squares 
On @2> &s> etc., along the principal diagonal; these correspond re- 
spectively to the latent roots g lt g^, g 3 , etc., and are subsidiary 
matrices each of order equal to the multiplicity of the respective 
latent root to which it corresponds. Since O is supposed non-de- 
generate, the characteristics of its latent roots are (m ; 1, 1, ... . 1), 

(«; 1,1, 1), (p; 1,1, 1), etc., respectively. Moreover, 

since the characteristics of g t are (m; 1, 1, 1), 
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In like manner, in the subsidiary matrix 2 the principal diag- 
onal is bordered on the right by unity, and the constituents in the 
principal diagonal are all equal to g 2 , the remaining m 2 — 2 m + 1 
constituents being zero; similarly with respect to the remaining 
squares, 3) etc. 

Let #i, & 2 , etc., denote the matrices obtained from 6 by erasing 
successively all the constituents of except, first, those in the 
square 6 t ; second, those in the square 6 it etc. Then 

6 = & 1 + & i + & i +••■■; 
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and since the &'s are nilfactorial together, i. e. since &% # 2 = & 2 &i 
= 0, etc., therefore 

*=#!* + <>,» + #,»+...'., 

etc., etc. 

Consequently, if f(ff) is any rational integral function of 6 of 
which k is the term independent of 6, and if f(ff) = f(6) — k, 
then 

f(fi) =f(fi) + * =/(*i) + /(*») + /(*•) + •■•• + *. 

Let [r, s] denote the matrix whose constituents are all zero 
except that in the r-th. row and s-th column, which is unity. We 
have then 

*i = 0i 2iT [r, r] + SiT 1 [r, r + 1], • 

#2 = ^2 2i" [i» + r, m + r] + 2,*7 1 [m + »", m + r + 1], 

*« = ^a Si? [m+n+r, m+M+r] + %'T 1 [m+n+r, m+n+r+l~], 
etc., etc. 

By the laws of multiplication of the symbols [r, a], viz. 
[r, «] [«, *] = [r, *], [r, s] \t, o]=0(s$ *), 

it follows that 

&i* = <7i 2 2 ,T [r, r] + 2 ^ S^" 1 [>, f + 1] + 2.V [r, r + 2], 

#x 3 = tf %Tr fr, r] + 3 «fc 2 ^V 1 [r, r + 1] + 3 * ^V [r, r + 2] 

+ Sw 3 [r, t + 3], 

♦,'=ji'S 1 :[v]4ft ,, r , [^i]+!4.'r , ['.'+2] 



•tyi" 1 ' L ' ' J 2dg< 



d"- 1 



+ -- + J^^iy.I g 7=-^^ 1 ' m ^ 



Therefore, if f'{g), f"{g)> etc., denote respectively the first, 
second, etc., differential coefficients of f(g) with respect to g, 
then 
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= vr 1 {^^s 1 v [nr + .]}, 

Similarly, we have 
/(#*) = /(?•) &" |> + r, m + r] + / fo) 2,? [» + r, m + r + 1] 

+ ^>S 1 "7 2 [m + r, m + r+2] + . . . . +^^[ m+ l, „ + „] 

= SoT 1 {"^^^V [» + r, m + r + .] ]■ , 

/(*«) =/(?«) 2i' [» + » + r, m + n + r] ' 

+ f'(9s) ti'7 l [m + n + r, m + n + r + i] 

+ /^)j 1 #-i^ +B + r) m + w + r+2 ] + .... 

/•(P-l) (g \ 

+ \p -i)i C TO + w + *» m + n +*] 

= $ a p l 1 {£^t 1 p 7'tm + n + r, m + n + r + •]}, 
etc., etc. 
And since /'(§>) = /'(<7), /"(<?) = /"(?), etc., and 
1 = Si™ [»", rj + 2v[w + r,m + r~] 

+ 2jr [m + » + r, w + w + r] H , 

it follows that 
/(*) = /(«) + k 

=/(*i) +/(*•) + /(#•) + ••••+* 

+ So'7 1 {"^yp^ Sif [» + n + r, m + n + r + «] J + etc. 

VOL. XXVII. (n. S. XIX.) 4 
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Let now 

/i W = [(* - ^i) ro - to - ffi) m T C(« - 9i) m - to - giVY ••••, 

the remaining factors corresponding to the remaining latent roots ; 
then for g = g lf 

fM =Ato) * 0, fi(g) = 0, f x »ig) = 0, .... f^~\g) = 0; 

for g = g 2 , fi(g) = 0, and likewise its first w — 1 differential 
coefficients; similarly, for g = ^ 3 , /i(y) and its first _p — 1 differ- 
ential coefficients are all zero, etc. Therefore 

It may be shown in like manner that 

Similarly, if 
/ 2 (<?) = [(tf-^)--^-^)-]™ [(0-?,)"- (&-<&)"? • • • • , 
then it may be shown in like manner that 

M\ (fl-^ = 2i"; 1 [»+r,» + r+l], 
4^r (fl - ft)* = Si'T* [m + r, m + r+2], 

/2 W 



4t\ (fi - ft)" 1 = [m + 1, m + »], 



etc., etc. 



OP ARTS AND SCIENCES. 



51 



Now, by reference to the theorem referred to above, it will be 
seen that if <f> is the most general matrix commutative with O, 
then <j> may be written as a> y uT 1 where 



= c 
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Vi 








Vs 



p. 



in which all the constituents are zero except those in the squares 
Vi> V& e * c -> along the principal diagonal, and these square arrays or 
subsidiary matrices are of order equal respectively to the order of 
the subsidiary matrices 6 X , 3> etc., to which they severally corre- 
spond; moreover, 
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a 2 b 2 c 2 rf 2 • • 
a 2 i 2 c 2 • • 
a 2 i 2 . . 
a' g . . 







etc., etc. 
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Therefore, if «j, c 2 , etc., severally denote the matrices formed from 
■q by erasing successively all its constituents except, first, those in 
the square rji, second, those in the square ij 2 > etc., then 

r) = €j + c 2 + , 

and 

«i = OiSiT [r, r] + ftxSi'",- 1 \r, r + 1] + dSr" 2 [r, r + 2] 

+ diSrr 3 [r, r + 3] + 4 2iT* [r, r + 4] + • ■ • • 

« 2 = «2Si* [m + r, mi + r] + fyjSw 1 [m + r, j» + r + 1] 

+ CatiVCm + r, m H- r + 2] + rf 2 2i n 7 3 [»» + »-, m.+ r+3] 
+ e 2 ti n -*[m + r, m + r + 4] + . . . . , 
etc., etc. 

But by what has just been proved it follows that 

£ i = ^^{^+h(e-g 1 )+e l {e-g x y+d l {e-g 1 f+e 1 {6-g i y-> r ....} f 

/ 1 OT 
J2 (ff2) 

etc. Consequently r/ is a rational integral function of 0. Let 
V = f(@)> wnere y(^) denotes the most general rational integral 
function of 6 ; then, since 

<o./(0).a,-l=/(O), 

the most general matrix commutative with O is 

<£ = cor/or 1 = m.f{6) . o)- 1 =/(«)• 

Worcester, Mass., January 2, 1892. 
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POSTSCRIPT. 

Wobcester, Mass., March 26, 1892. 

I find that the solution of the matrical equation Q<f> = <j>Q given 
in a note in these Proceedings, May 26, 1891, and upon which 
the proof of the theorem given above is based, gives the complete 
solution of the more general matrical equation Q, <£ = <p Q' for that 
case in which it may be assumed that the determinant of <j> is not 
null. For this case the latent roots of 12' are identical with those 
of O, and have, respectively, the same characteristics, i. e. the nulli- 
ties of successive powers of Q' less any latent root g are respectively 
equal to the nullities of successive powers of il — g. Therefore, 
if the canonical form of O is u> u>~\ the canonical form of O' is 
m a/ -1 .* Matrices thus related are termed by Weyr matrices of 
the same kind. 

If, then, in the equation 

Q and O' are replaced by their equivalents, it becomes 
that is, 

6 . <a~ ' <j> <o' = <o~ l <f> w . 6. 

If, now, we put 

\j/ = &T" 1 (f> <a') 

the equation becomes 

6ty = x\i6. 

But the most general solution of this equation is given in the note 
above referred to ; the most general matrix commutative with 6 is 
there denoted by rj. Therefore, the most general solution of the 
matrical equation 

<£ n = Q' <f> 
is 

<£ = to if/ &)'"', 

* For explanation of the term canonical form of a matrix, see paper referred 
to, these Proceedings, Vol. XXVI p. 64. 
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if the canonical forms of 12 and 12' are respectively uOur 1 and 
o)' 6 to' -1 , and provided i/r is the most general matrix commutative 
with 6. 

This solution of the equation 12 <£ = <£ 12', as also the above 
mentioned solution of the equation 12 <f> = <f> 12, depends upon the 
reduction of a matrix to its canonical form. In the assumed 
canonical form of 12, viz. <a 6 <o~\ the matrix 6 is dependent solely 
upon the characteristics of the latent roots of 12. Having deter- 
mined the latent roots of 12 and their characteristics, if one value 
of <o is known, the solution of the preceding equation 12 <£ = <j> 12' 
gives in terms of u> every solution of the problem to reduce a given 
matrix 12 to its canonical form. For if 12' = 6, the most general 
solution of the equation 

O $ = <£ 12' 

reduces to 

where \f; is the most general matrix commutative with 0, and m is 
some particular value of <f> satisfying the equation 12 = <j> 6 <fr 1 . 
But since the solutions of this equation are all included in those of 
the equation 12 <f> = <f> 6, if the constants in i/r are so taken that its 
determinant (and consequently the determinant of <£) is not null, 
<£ = <o\jr is the most general solution of the problem to reduce a 
matrix to its canonical form. 

If O and 12' are matrices of the same kind,* and such that for 
any latent root g the increments of nullity of successive powers 
of Q — g (or 12' — g) are severally equal to the nullity of 12 — g 
(or of 12' — g), until a power is reached whose nullity is equal to 
its vacuity, then the most general solution of the equation 
12 <f> = <j> 12' can be expressed in terms of certain arbitrary matrices 
and of products of powers of 12 and 12' less their latent roots. Thus, 

let the latent roots of 12 (and of 12') be g ly g 2 , g„ respectively 

of multiplicity m x , m^, m { ; and suppose that the nullities of 

12 — g lf 12 — g 2 , etc. are m\, m' 2 , etc. From the supposed equality 
in the increments of nullity of 12 — g v etc., it follows that 



are all integers. Let, now, 
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* That is, a and fl' are such that it is possible to find a matrix \ *° satisfy 
the equation D.' — % n X~ '• 
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fM = (Tlr* - l^g^r ii^r - gT^J*)** .... 

ii=- 9 r - -jZ^r- 1 iF^gr - gT^ST* 1 •••• 

n -fM a -££*> 

r ~fri9rY r ~figr)' 

Then the most general expression for the matrix <f> satisfying the 
equation O <£ = <£ 12' is 

* = So^-HC" - yi) w >~ r_1 Oi Mi «i («' - ffl y} 

+ 2o^ -1 {(n - ff^Y'-'- 1 Qi m 2 o' 2 (o' - 9i y) 

+ .... + ^"'{(O - SV)*'-- 1 ^ M fi', (O' - fl),}, 

where M u M 2 , .... M ( , are arbitrary matrices. 

If Q' = 0, the expression to which this expression for <f> reduces 
gives the most general matrix commutative with Q. 

If O' = 6, and the matrices M in the above expression for <£ are 
so taken (as is possible) that the determinant of <f> is not null, then 
the expression for <j> gives the most general solution of the reduc- 
tion of the given matrix O to its canonical form. 



